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PREFACE; In Praise of Bugs 

Two summers ago, at the Ann Arbor third meeting on Algebraic Combinatorics (June 1994), 
and also at the Taormina conference in honor of Adriano Garsia (Aug. 1994), George Andrews 
described, rather dramatically, some apparent shortcomings of the WZ method (see [PWZ] for a 
description of this method). At one moment, he displayed the printout of the computer-generated 
proof of a certain hypergeometric identity, that was outputted by the Maple package EKHAD (that 
now accompanies [PWZ]). This necessitated 14 transparencies, taped together, which Andrews 
rolled down on the floor, producing an obvious comic effect. 

The identity was ((1.6) of [Al]): 

/_2,-l,x + 2i + 2,x-z + l/2,x + i + l,z + i + l \ 

(x + l)/2,:r/2 + l,2z + 2z + 2,2x-2z + l 'V" ' ^ ^ 

where we use the standard hypergeometric notation (see [GKP],[PWZ]). 

In fact, the printout that was displayed was not even the proof of the identity itself, that was 
beyond the capabilities of EKHAD at the time, but the proof of a certain special case, obtained by 
specializing the parameter z. 

As it turned out, EKHAD's failure to give a complete proof of (MRR) at the time, as well as 
the length of the proof of the special case, were not the algorithm's fault, but were due to a 
typo in the Maple coding of the second author'^. Once this typo was corrected, the proof of 
(MRR) that EKHAD outputs is rather short (it fits in less than one transparency) , and is produced 



relatively fast. Both the input and output files can be viewed in pttp : //www .math, temple . edu/" 
zeilberg/synd.html, the present paper's Web Page. 

This was a lucky mistake, since it lead Andrews to develop an elegant new technique, that he dubbed 
'Pfaff's method' ([Al-3]). Using this new technique, he not only proved {MRR), but in the process 
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It escaped notice before because it occurred in a rarely visited part of the program. 



was naturally lead to the discovery and proof of nineteen additional brand new hypergeometric 
identities. These identities would never have been discovered had EKHAD been perfect two years 
ago. 

After the current version of EKHAD effortlessly proved identity (MRR), we were sure that it 
would just as easily prove all these other identities. This was implicit in the second author's talk in 
Herb Wilf's birthday conference, that took place in Philadelphia, June 12-15, 1996. When George 
Andrews asked us whether these new identities are equally provable by EKHAD, in real time, we 
replied that while we did not try yet, we were sure that they would pose no problem. 

Imagine our surprise, and slight embarrassment, when the new, improved, and debugged, version 
of EKHAD ran out of memory in each and every one of Andrews's twenty new identities, except for 
two (the original (4.2) (which is (AIRR)) and (4.20) of [Al]). When one specializes the auxiliary 
parameters x and z to be specific rational numbers, then the output is ready after a few seconds. 
When only one of the parameters {x or z) is specialized, it takes about an hour of CPU time (on 
a Sun SPARC), but when both x and z are left alone. Maple ends with a memory fault (on our 
system). 

In this paper we will outline how to get around this problem, and how to produce absolutely rigorous 
WZ-proofs for each of these 18 identities (and possibly many others yet to be discovered, on which 
EKHAD might fail for lack of memory), using commonly available computers. However, while it 
is feasible to get such a rigorous proof in each case, it is very time-consuming, and takes about a 
week running on our system on nice. On the other hand, the very same method also produces 
semirigorous (see [Z]) proofs very fast. Since we know that the identities are true (even if Andrews 
did not have a proof), and we know that we can find a complete proof, if we only were willing to 
wait a week/identity, we don't see the point of actually wasting the Temple Math Department's 
computer resources. 

Any of our readers who wishes to have a complete proof is welcome to download our Maple package 
SYND, as well as any of the input files given in this paper's Web Page, on their own computer. 
Alternatively, we would be happy to run the program for a fee of $300/identity. Once the fee is 
paid, the program would be run, and once it terminates, the output would be published in the Web 
Page, and the name of the donor would be prominently displayed. This is a good way to honor 
friends and relatives, by naming the proof after them. Instructions on how to order a proof are 
given in the Web Page mentioned above. 

In order to demonstrate feasibility, we have run the program SYND on one of Andrews's new iden- 
tities: (4.4) of [Al]. The input and output files can be gotten from our Web page. 

A Short Review^ of Creative Telescoping and WZ-certification 

Let F{n, k) = F{n, k,ci, . . . ,Cr) he proper hypergeometric (see [PWZ] , p. 64 for definition) in all its 
arguments. The Fundamental theorem of [PWZ] guarantees that there exist a non-negative integer 
J, and polynomials aj{n) = aj{n,c-\_, . . . ,Cr), j = 0, . . . , J free of k, as well as another proper 
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hypergeometric term G{n,k) = G{n,k,ci, . . . ,Cr), such that G/F is a rational function in all its 
arguments, and such that: 

J 

Uj (n)F(n + j, k) = G{n, k + 1)- G{n, k) . {CT) 

3=0 

It follows immediately, by summing w.r.t. k, that the definite sum: 

A{n) = A{n,ci,. . . ,Cr) := -F(n, fc, ci, . . . , c^) 

k 

satisfies the linear recurrence equation with polynomial coefficients 

J 

aj {n)A{n + j) = . {Recurrence) 

3=0 

Hence, in order to prove a conjectured identity 

A{n) = B{n), 

where B{n) is a certain explicitly given expression, all we have to do is verify that B{n) also satisfies 
the same recurrence, i.e. 

J 

Y,aj{n)B{n + j) = Q , 

3=0 

then check that the leading coefficient aj{n) does not vanish at positive integers (if it does we just 
begin at the highest integer root, and check empirically the finite number of cases before). Finally 
we check that A{j) = B{j) for j = 0, . . . , J — 1. The identity A{n) = B{n) then would follow, by 
induction, for every positive integer value of n. 

The recurrence and certificate, G{n,k), featured in (CT) can be found with the Gosper-Zeilberger 
algorithm ( [P WZ] [GKP] ) . 

In this algorithm, in addition to the unknowns qq, . . . ,aj one is also looking for the coefficients, 6j, 
of a certain polynomial: 

K 

b{k;n, ci, ...,Cr) = ^6i(n,ci, . . . ,Cr)/c* 

i=0 

such that equation (6.1.3) of [PWZ], p. 108, holds, i.e. 

P2ik)b{k + l)-ps{k-l)b{k)=p{k) , {G-Z) 

for certain explicit polynomials p(k),p2{k),ps{k) that also depend on n, ci, . . . , c^., and that are 
derivable from the summand F{n,k). Furthermore p{k) involves the unknown coefficients of the 
recurrence aj (n) linearly. 
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The next step is to expand {G — Z) in powers of k and compare the coefHcients, getting a homo- 
geneous system of linear equations in the unknowns ao, ai, . . . , aj, 6o) • ■ • , ^x- If the system has 
no non-trivial sohitions, it means that J was too low, so we try again with J <— J + 1. We are 
guaranteed to succeed eventually by the Fundamental Theorem (see [PWZ]). 

Until Andrews's paper [Al], the above was as easily said as done, and EKHAD had no difficulty 
in proving any identity of the above form. Sure, the largest contemporary computer is probably 
unable to find the recurrence that 



satisfies, but for single sum summations with conjectured (or already known) explicit expressions, 
it never failed, as far as we know. Imagine our chagrin (and George Andrews's justified glee) at the 
failure of our program on his new identities. We should remark that the failure was not theoretical 
but practical: insufficient memory to handle the large objects encountered. 

The Bottle-Neck that Causes the Andrews Syndrome 

As we saw above, the heart of the Gosper-Zeilberger algorithm is the solving a certain system of 
linear equations with symbolic coefficients. When the summand -F(n, k) only depends on n and fc, 
and the extra arguments (parameters) ci,C2, . . . ,Cr are absent, then the entries of the coefficient 
matrix of the system are polynomials in the single variable n. However, when other parameters 
are present (in Andrews's case we have x and z in addition to n), then the entries are rather large 
polynomials of several variables (in Andrews's case of the three variables n^x,z). 

It turns out that, say for (4.4) of [Al], the number of unknowns is 8: aoi Oi, ^2, ^3 (the coefficients 
of the recurrence operator) and b(),bi,b2,b-i (the coefficients of the polynomial p{k)). In order to 
find the recurrence operator oq + aiN + a2iV^ + aaN^ (where N is the forward shift in the variable 
n: Nf{n) := /(n + 1)), and the polynomial b{k) = bo + bik + b2k^ + bsk^ , we have to solve a certain 
linear system: 



where M is an 8 x 8 matrix whose entries are certain (rather large) polynomials of the three variables 
n, X, z. As we said above, solving this system is beyond Maple's capabilities running on our system. 
Our readers axe welcome to try EKHAD on any of Andrews's identities on their system. The input 
files for EKHAD (that failed on our system, but may work on bigger ones) are given in this paper's 
Web page. We would be happy to hear of reports of any successful run, and we would announce 
them on that very same Web page. 

A general WZ procedure for lowering the order of a recurrence by 1 

Suppose X^fc -f^(?^, fc) = r{n) is some identity that we wish to prove. Recall the WZ paradigm, 
which says to divide by the right hand side, and to try to prove instead the identity /(n) := 
^].{F{n,k)lr{n)) = 1. Well, here is another possible advantage of this paradigm, which was 
pointed out by Herb Wilf: since the recurrence formula that the left side satisfies must have the 




M{ao,ai,a2,a3,bo,bi,b2,b3)'^ = 



{Linear System) 



4 



solution /(n) = 1, it follows that the recurrence operator will always have a right factor of AT — 1, 
where N is the forward shift in n. That being the case we can look for the recurrence that is 
satisfied by (N — l)/(n), and it will be of one lower order than the original. To look for that 
recurrence, let's rename the summand F(n, k)/r{n), and call it -F(n, k) again. Then what we want 
to do is to apply the Zeilberger algorithm to the summand F{n + 1, k) — F{n, k), instead of applying 
it to F{n, k) itself. We obtain a recurrence of order 1 less than we would otherwise have found, 
and we then need to show that it has only the trivial solution, which we do by displaying enough 
initial zero values. This procedure is perfectly general, and it applies in any situation where the 
form of the conjectured sum is known. 

Getting Around the Andrews Syndrome 

Consider again an identity of the form 

^F{n,k,ci, ...,Cr) = B{n,ci, ...,Cr) . 
fc 

As above, we make the right side 1, by dividing by Bin). Renaming F <— F/B, we have to prove 
an identity of the form (suppressing the dependence on ci, . . . , c^): 

A{n) ■.= ^F{n,k) = 1 . 

k 

It is easy to check whether ^4(0) = 1. hence it suffices to prove that 

A{n):=A{n+l)-A{n):=Y^[F{n+l,k)-F{n,k)]=0 . (Efes) 

k 

In 99% of the cases, one can apply Gosper's algorithm to the summand F{n + l,k) — F(n, k) 
(with respect to k, see [PWZ], chapter 7), and get another hyper geometric term G(n,k) (which 
furthermore can be written as R{n, k)F{n, k), where R{n, k) is a rational function), such that 

F{n + l,k) - F{n,k) = G{n,k + l) -G{n,k) . (WZ) 

This is the WZ-equation. Summing (WZ) with respect to k immediately yields (Efes). Summing 
(WZ) with respect to n yields the so-called companion identity (see ibid.). 

However, in the remaining %1 of the cases, that include Andrews's twenty identities discussed 
here, Gosper's algorithm tells us that no such (hypcrgcomctric) G(n, k) exists. The more general 
Gosper-Zeilberger algorithm guarantees, however (writing F{n,k) := F{n + l,fc) — F(n,k)) that 
there exist a non-negative integer J, and oq, . . . , aj, and G{n, k) such that 

J 

J2aj{n)F{n+j,k)=G{n,k+l)-G{n,k) . {CT') 

3=0 
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As before, summing with respect to A;, we get: 

J 

^^aj{n)A{n + j) = . {Recurrence') 

3=0 

Once these are found it would follows that A{n) = provided it is so for n = 0, . . . , J — 1. 

Note that the 'J' of {Recurrence') is one less than that of {Recurrence), since considering A{n) 
rather than A{n) reduces the order of the recurrence by 1. 

While the above process reduces the number of unknowns by 1, the resulting symbolic linear system 
still proves too big for Maple to handle. And now comes the second twist on the Gosper-Zeilberger 
algorithm. In order to prove that A{n) = 0, we don't actually have to know what the oq, ai, . . . , aj 
and the bo,-- - ,bK are exactly! All we have to know is that they exist!, plus we have to make sure 
that the leading coefficient aj{n) of the recurrence does not vanish on any positive integer value of 
n (or if it does, then we have to know the highest such value). 

The existence of a non-trivial solution of the homogeneous linear system {Linear System) (when it 
now applies to the modified sum as above), is exactly equivalent to the fact that the determinant 
of the square matrix M vanishes. This determinant is a polynomial in the variables n,ci, - - - ,Cr. 
If we can find a priori bounds for the degrees in each of its variables, then plug in enough special 
cases (the number of which should be at least equal to the product of 1 plus the degrees in each of 
the variables) into this determinant, and evaluate these numerous, but fast-to-compute resulting 
numerical determinants, and check whether they are always 0, we would have a completely rigorous 
proof. 

As hinted in [Z], if you take a non-zero polynomial out of the blue and plug in random values, it 
is extremely unlikely that it would be zero. It such a polynomial yields zero for, say, a hundred 
different tries, then it is much more likely that the entire framework of mathematics is flawed than 
that this particular polynomial is non-zero. Hence we have a situation where in order to prove an 
identity we have the costly, but feasible, option of a completely rigorous proof, and an extremely 
fast and inexpensive way to prove it with probability 1 — e. In the package SYND, that comes with 
this paper, and that can be downloaded from its Web Page, the procedure PR00F2 has Certainty 
as one of its arguments, that can be adjusted by the user. Setting Certainty:= 1 yields a rigorous, 
but time-consuming, proof. Setting it, to say 0.1 would already yield a 99.9999%-sure proof very 
fast. 

So now we know (either for sure, or almost surely) that the ao{n, ci, . . . , c^), . . . , aj{n, ci, . . . , c^), 
and the G{n,k) in Eq. {CT') exist, and hence Eq. {Recurrence') holds, for some Oq, . . . ,aj. 
We don't really care what they are except, as mentioned above, we have to make sure that 
aj{n,ci, . . . ,Cr) does not vanish on positive integers (and if it does, then to find the largest such). 
But this is easy and fast. If aj{n,ci, . . . ,Cr) would have vanished at n = no, then (n — no) would 
have been a factor, and it still would have been a factor of aj{n, ci, - . . , c°), for any specialization 
ci = ci,. -- ,Cr = c°. So all we have to do is apply procedure ct or zeil of EKHAD to any such 



6 



specialization, and make sure that the polynomial oj does not have a factor of the form (n — no). 
Now the summand only depends on two symbols: n, and k, and ct runs very fast (at least on 
Andrews's identities discussed here). 

A Priori Bounds For the Degrees of the Determinant 

In order to prove that a polynomial P{xi, . . . , Xm) is the zero polynomial, by plugging in sufficiently 
many values for its arguments, we need a priori bounds for its degrees in each and every one of 
its arguments. If its degree in Xi is di, then checking at all the integer points — (di + l)/2 < Xi < 
{di + l)/2 (i = 1, . . . , m), and verifying that the value of P is zero at each of these points, would 
constitute a rigorous proof that P is the zero polynomial. 

In the present scenario, the polynomial P(xi, . . . , Xm) is given as a determinant P := det(ai^j(xi, . . . , x^)), 
where (a^j) is a square matrix. To get an upper bound (usually very dull), for the degree of the 
determinant in the variable x^, we replace each entry ai^j by its leading term with respect to Xi. 
For example Ax'^z^ + 3x^z^ + f>xz would be replaced by 4x'^x^ when the degree in x is sought, and 
by 3x^z^ when the degree in z is desired. Next, we take the permanent. Since the permanent of 
a matrix with monomial entries can't have any cancellations, the degree in the variable Xi of the 
resulting permanent would definitely constitute an upper bound for di. 

The Package Synd 

The present method (or rather the present twist on the old method) is implemented by the Maple 
package SYND, that accompanies this paper. It can be downloaded from either 



tittp : //www. math, temple . edu/~ zeilberg/synd.html (download SYND), or from 



|f tp : //ftp .math, temple . edu/pub/zeilberg/programs , or by anonymous ftp to ftp . math . temple . edu 



(login as anonymous, password as instructed (your E-mail address), then cd pub/zeilberg/programs 
<CR>, followed by, get SYND <CR>.To exit ftp you type: quit <CR>. 

Once you have SYND on your own computer, make sure that you have Maple (if you don't, get it!), 
and that you are in the directory where SYND is. Then get into Maple by typing maple <CR>. Once 
inside Maple, type read SYND; <CR>, and get the on-line help by typing ezraO ;CR. 

The two main procedures are PROOFl and PRD0F2. The former handles sums with one extra pa- 
rameter, while the latter handles sums, like those of Andrews[Al], with two extra parameters (x 
and z). We did not bother to write the analogous procedures for more auxiliary parameters, but 
the Maple-literate reader can do it with no trouble. 

The function cah for PROOFl is: 

PROOFl (SUMMAND , RHS , k , n , LowerLimit , UpperLimit , a , Certainty) ; 

Here SUMMAND is the hypergeometric summand, given either in terms of factorials, or binomial 
coefficients, or rf, where rf (a,k) is the raising factorial (a)^ := a{a + \) ■ ■ ■ {a + k — 1); RHS 
is the conjectured right hand side; k is the (single-) summation variable; n is the variable with 
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respect to which the recurrence is desired; LowerLimit and UpperLimit are respectively where 
the summation starts and ends; a is the auxihary parameter; Finally, Certainty is the rigor-level. 
Setting Certainty : =1 ; , would yield a rigorous proof. Setting it to anything less would give a 
semi-rigorous proof. Be warned that Certainty=0 . 1 does not mean that the probability that the 
identity is true is %10! It is probably true with probability %99. 999999. The meaning of Certainty 
is the fraction of trials that we attempt, compared to the full intervals in n and in a, that is needed 
for a rigorous proof. 

Example: While EKHAD can handle the Chu-Vandermonde identity 




in a few seconds, just for the sake of example, here is the function call that does it in SYND. Since 
Certainty is set to 1, this would yield a completely rigorous proof. 

PROOFl (binomial (a, k) *binomial(n,k) ,binomial(a+n,a) ,k,n,0,n,a, 1) ; 

PR00F2 is exactly as above, except that we have to specify the two auxiliary parameters: x, and z. 
The function call for PR00F2 is: 

PR00F2(SUMMAND,RHS,k,n, LowerLimit, UpperLimit, x,z, Certainty) ; 
where x and z are the auxiliary parameters. For example the Dixon identity 

' \a + kj\n + kj\b + kj ~ a\b\n\ ' 
is proved, by SYND, with the following call: 

PR00F2((-l)**k*binomial(a+b,a+k)*binomial(a+n,n+k)*binomial(b+n,b+k) , (a+b+n) !/a!/b!/n! ,k,n, 
-n,n,a,b, 1) ; 

(Once again this is only for the sake of example, since EKHAD does it effortlessly). 
Andrews's 20 Identities 

The input files for all the identities of section 4 of [Al], as well as the corresponding output files 
(with Certainty=0 . l), can be retrieved from the Web page. As we said above, the only identity for 
which we ran the program with Certainty=l is identity (4.4). Readers who have Maple and have 
idle time to spare, are welcome to download any or all the input files, change the last argument of 
PR00F2 to 1 (instead of 0.1), and run it on their computer. We request that you notify us, so that 
we can announce that the WZ-style rigorous proof of the given identity has been performed, with 
due acknowledgement to the computer and the human owner. 
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Lily Yen's Method 



Another possibility, that might work, of getting around the Andrews syndrome is to adapt Lily 
Yen's [Y] beautiful approach. Yen found an a priori bound, L , easily derivable from the identity, 
such that if the identity is true for n = 0, 1, . . . , L, then it would be true for all n. Unfortunately, 
it is so enormous that at present it only has theoretical interest. However, it seems to us that 
an appropriate modification to the situation described in the present paper, where one has extra 
parameters, would yield quite small and practical upper bounds. 

The reason for the gargantuan size of Yen's upper bound is having to bound the largest integer 
root of the leading coefficient aj(n). The upper bound for the order of the recurrence, J, is quite 
small. Since now we have extra parameters, we can itiIc out the possibility of positive integer roots 
as above, by running ct of EKHAD on a specialization. It would be interesting and useful to make 
this precise, and to implement it. 

Peter Paule's Method 

Andrews's identities are also interesting from the WZ theory point of view. The orders of the recur- 
rences outputted by EKHAD (3 and 2) are two higher than the orders of the minimal recurrences 
(1 and 0) satisfied by the right sides. 

They provide many new non-trivial examples to the phenomenon described on p. 117 of [PWZ]. 
This phenomenon, of not getting the minimal recurrence, is much more widespread for q-series, in 
which Peter Paule[P] introduced a very useful order-reducing preprocessing device. Paule's method 
is also useful for ordinary hypergeometric sums. 

At present it is not clear how to apply Paule's method to Andrews's sums, but wc suspect that an 
appropriate generalization will do the job. Perhaps Andrews's identities are limiting cases of more 
general identities that come from multiple sums, on which there would be some obvious symmetry 
group with respect to which one would be able to apply Paule-symmetrization. 

Conclusion 

Our Wise men, let their memory be blessed said: 'Kinat Sofrim Tarbe Khochma', which, roughly 
means: 'Rivalry among scholars increases knowledge'. In the present case the rivalry is between 
human and machine. While the meta-mathematical debate engendered in [Z] and [A4] is unlikely 
to be resolved in our time, mathematics proper does benefit. 

The second moral is that mistakes are crucial for progress. According to the current dogma in 
biology, we humans would have still been amoebas if not for a series of lucky mistakes in biological 
transmission of information. The same can be said for science, and even for mathematics. If the 
previous version of EKHAD did not contain a bug, George Andrews would have obtained the proof 
of {MRR) right away, and would never have needed to find another proof, that lead to twenty 
new beautiful identities. While the WZ method is also capable of discovering (and proving at the 
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same time) new identities out of old ones, we do not know at present how to rediscover, naturally, 
Andrews's identities from scratch. 



Bruno Salvy has told us that the question of deciding how many random checks of the vanishing 
of a polynomial are needed in order to deduce its nullity, within a prescribed margin of error, has 
been dealt with. See his message[B] that can be accessed from this paper's Web page. 

One last point. The present method is easy to parallelize. This is because it boils down to checking 
many special instances, that can be done independently of each other. If the need would arise in 
the future to prove a huge identity (for example because it would imply the Riemann Hypothesis), 
then it would be feasible to have an International collaboration of many computers. 
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